Accommodating error analysis in comparison and clustering of molecular fingerprints.
نویسندگان
چکیده
Molecular epidemiologic studies of infectious diseases rely on pathogen genotype comparisons, which usually yield patterns comprising sets of DNA fragments (DNA fingerprints). We use a highly developed genotyping system, IS6110-based restriction fragment length polymorphism analysis of Mycobacterium tuberculosis, to develop a computational method that automates comparison of large numbers of fingerprints. Because error in fragment length measurements is proportional to fragment length and is positively correlated for fragments within a lane, an align-and-count method that compensates for relative scaling of lanes reliably counts matching fragments between lanes. Results of a two-step method we developed to cluster identical fingerprints agree closely with 5 years of computer-assisted visual matching among 1,335 M. tuberculosis fingerprints. Fully documented and validated methods of automated comparison and clustering will greatly expand the scope of molecular epidemiology.
منابع مشابه
Determination of the Best Hierarchical Clustering Method for Regional Analysis of Base Flow Index in Kerman Province Catchments
The lack of complete coverage of hydrological data forces hydrologists to use the homogenization methods in regional analysis. In this research, in order to choose the best Hierarchical clustering method for regional analysis, base flow and related index were extracted from daily stream flow data using two parameter recursive digital filters in 43 hydrometric stations of the Kerman province. Ph...
متن کاملAn algorithm for clustering cDNA fingerprints.
Clustering large data sets is a central challenge in gene expression analysis. The hybridization of synthetic oligonucleotides to arrayed cDNAs yields a fingerprint for each cDNA clone. Cluster analysis of these fingerprints can identify clones corresponding to the same gene. We have developed a novel algorithm for cluster analysis that is based on graph theoretic techniques. Unlike other metho...
متن کاملSelective in-vitro Enzymes’ Inhibitory Activities of Fingerprints Compounds of Salvia Species and Molecular Docking Simulations
Recently Nutrition and Food Chemistry researches have been focused on plants and their products or their secondary metabolites having anti-alzheimer, anti-cancer, anti-aging, and antioxidant properties. Among these plants Salvia L. (Lamiaceae) species come into prominence with their booster effects due to high antioxidant contents, which have over 900 species in the world and 98 in Turkey. Some...
متن کاملSelective in-vitro Enzymes’ Inhibitory Activities of Fingerprints Compounds of Salvia Species and Molecular Docking Simulations
Recently Nutrition and Food Chemistry researches have been focused on plants and their products or their secondary metabolites having anti-alzheimer, anti-cancer, anti-aging, and antioxidant properties. Among these plants Salvia L. (Lamiaceae) species come into prominence with their booster effects due to high antioxidant contents, which have over 900 species in the world and 98 in Turkey. Some...
متن کاملAn Improved SSPCO Optimization Algorithm for Solve of the Clustering Problem
Swarm Intelligence (SI) is an innovative artificial intelligence technique for solving complex optimization problems. Data clustering is the process of grouping data into a number of clusters. The goal of data clustering is to make the data in the same cluster share a high degree of similarity while being very dissimilar to data from other clusters. Clustering algorithms have been applied to a ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Emerging Infectious Diseases
دوره 4 شماره
صفحات -
تاریخ انتشار 1998